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PROCESSOR INTERFACE 



The present invention relates to a processing system, a data handling system and a 
stream register, which are in particular but not exclusively for use with a peripheral device. 

It is known in the art to provide a system including a processor, such as a Central 
Processing Unit (CPU) that is capable of running a computer program. During running of 
the program, it is likely that data will be required by the program from a number of 
sources. It is usual for the majority of the required data to be obtained from the main 
memory of the system via a memory bus. It is also common for some data to be required 
from one or more peripheral devices, connected to the CPU via a memory bus. 

In order for the CPU to obtain data from a peripheral device, a request signal is sent to the 
memory bus. The signal then passes from the bus to the peripheral device, wherein the 
requested data is retrieved from the peripheral device and sent, via the memory bus, to 
the CPU. 

One problem with the known system is that the peripheral device may not have the 
necessary data exactly when requested. In which case, for correct operation, the 
peripheral must stall returning of the data accessed until the data is available. During this 
delay, which may be highly variable, or in the case of a fault, indefinite, the CPU is unable 
to proceed with any processing, particularly of pending interrupts, until the correct data is 
returned. It would be desirable to provide a system capable of processing interrupts in a 
short and determinate time. 

Another problem is that if the delay is indefinite, it is inconvenient for the user of the 
system, since in order to recover the entire system must be reset. One way of avoiding 
this is to include polling loops in the program code at each point in the code where data 
from the peripheral device is requested, but the problem with this is that it makes coding 
more complicated and slows down the rate at which data can be read. In addition, while 
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the CPU is actively polling the peripheral, it is using both memory bus bandwidth and 
power. 

It would be desirable to mitigate the problems of the known system. 

According to a first aspect of the present invention there is provided a processing system 
for accessing first and second data types, the first data type being data supplied from a 
peripheral and the second data type being randomly accessible data held in a data 
memory, the processing system comprising : a processor for executing instructions; a 
stream register unit connected to supply data from the peripheral to the processor; a FIFO 
connected to receive data from the peripheral and connected to the stream register unit by 
a communication path, along which the said data can be supplied from the FIFO to the 
stream register unit; and a memory bus connected between the data memory and the 
processor, across which the processor can access the randomly accessible data. 

According to a second aspect of the present invention, there is provided a streaming data 
handling system, comprising : a processor; a stream register associated with the 
processor; a FIFO memory connected to the processor via the stream register, wherein 
the stream register and the FIFO operate the same data handling protocol such that the 
stream register can receive streamed data items from the FIFO memory and supply them 
to the processor in the received order. 

According to a third aspect of the present invention, there is provided a stream register 
connectable between a processor and a peripheral, the stream register comprising : a 
receiver arranged to receive a request for a data item from the processor; and a stream 
engine, arranged to send the request to the peripheral and receive one or more signals 
back from the peripheral indicating availability of the requested data item, and, if the data 
item is available, send the data item to the processor and if the data item is not available, 
send a timeout signal to the processor. 
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According to a fourth aspect of the present invention, there is provided a stream register 
connectable between a processor and a peripheral, the stream register comprising : a 
receiver arranged to receive a request for a data item from the processor; and a stream 
engine, arranged to send the request to the peripheral and receive one or more signals 
back from the peripheral indicating availability of the requested data item, and, if the data 
item is available, send the data item to the processor and if the data item is not available, 
send a stall signal to the processor. 

Embodiments of the invention will now be described, by way of example only, with 
reference to the accompanying drawings in which : 

Figure 1 shows a processing system in accordance with a first embodiment. 
Figure 2 shows part of the system of figure 1 ; 

Figure 2a shows a slightly modified version of the part of the system shown in figure 2 
Figure 3 shows the part of the system of figure 2a, modified in accordance with a second 
embodiment; 

Figure 4 shows part of the system modified in accordance with the second embodiment. 
In the figures like reference numerals indicate like parts. 

Referring firstly to figure 1, there is shown a system, indicated generally by reference 
numeral 1. The system comprises a CPU 2 connected to a memory bus 3. Connected to 
the memory bus 3 are each of a main memory 11, a first peripheral device 13 and a 
second peripheral device 15. 

The CPU 2 is additionally connected directly to a third peripheral device, which in this 
case is a video decoder (VDC) 4, via a communication channel 8. 

Turning now to figure 2, a detail of part of the system 1 of figure 1 is shown. The CPU 2, 
the memory bus 3, the communication channel 8, and the VDC 4 are visible. The VDC 4 
receives video data in at arrow 12 from an external source such as a satellite transmitter. 
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The CPU 2 has a stream register unit 5, a register file 26 containing a plurality of reg.sters, 
a load/store unit 28 and an execution unit 27 which runs programs. The stream reg.ster 
unit 5 provides the data required by the execution unit 27 from the VDC 4. It has a 
receiver 29 for receiving requests for data from the execution unit 27. 

The VDC 4 comprises a source process 14 to which the video data coming in passes. 
This source process 14 is connected to issue data to a "first-in-first-ouf (FIFO) memory 
16. The channel 8 is specifically connected to the VDC 4 at the FIFO 16. 

In operation, the execution unit 27 of the CPU 2 runs a program, which it retrieves from 
the main memory 11 as code in the form of a list of instructions. Many of those 
instructions require data from the main memory 11, to be retrieved in a known manner. 
However, some instructions require video data from the VDC 4. In this embodiment, the 
stream register unit 5 of the CPU 2 receives data from the VDC 4 without the need to go 
via the bus 3. It does need to send requests for data through the communication channel 
8 but this channel 8 is merely a data transfer medium. 

The channel 8 works by communicating data in order, i.e. the n-th data item 
communicated will arrive after the (n - 1)th item and before the (n + 1)th data item. Thus 
the execution unit 27 makes n requests for n items of data and these requests are passed 
in the requested order over the channel 8 and the data is retrieved from the FIFO 16 of the 
VDC 6 and returned to the CPU 2 in that order. This process makes use of the standard 
"data" "valid" and "taken" signals used by FIFOs (the taken and valid signals are 
sometimes known as "request" and "grant" signals). This process is explained in the 
following paragraphs with reference to an item of data. 

The execution unit 27 makes a request for an item of data to the load/store unit 28 (arrow 
37) This request is directed to a receiver 29 of the stream register unit 5 (arrow 38). If the 
stream register unit 5 contains data this is returned to the load/store unit 28 (arrow 40), 
which temporarily stores the data in the register file 26 (arrow 42), from where it .s 
retrieved by the execution unit 27 (arrow 44). If no data is available a stall signal is sent 
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from the stream register unit 5 to the execution unit 27 (arrow 46), and the execution unit 
27 stalls pending data appearing in the stream register unit 5. 

If the stream register unit 5 contains no data, the "request" (or "taken") signal is put to logic 
high across the communication channel 8 from the stream register unit 5 to the VDC FIFO 
16. If the "grant" (or "valid") signal from the VDC FIFO 16 to the stream register unit 5 is in 
response set to logic high at the same time, data is transferred from the FIFO 16 to the 
stream register unit 5. In this way the next item in the FIFO 16 at the end of the 
communication channel 8 is transferred to the stream register unit 5, and thus video data 
is transferred from the VDC 4 to the stream register unit 5, ready for reading by the 
execution unit 27. 

Since data needs to be received externally by the VDC 4 and processed in the source 
process 14 before being transferred to the FIFO 16 and hence being made available to be 
sent to the CPU 2, there is a danger that there may be a delay between the execution unit 
27 of the CPU 2 requesting the data and it being available. Thus if the data item 
requested by the execution unit 27 is not in the next location of the FIFO 16, the FIFO 16 
returns a valid signal low, to indicate to the CPU 2 that the data item is not there and in 
this case the data signal does not contain the data item. In this situation, because of the 
requirement to retrieve data in order, the data request and retrieve process is stopped and 
the execution unit 27 of the CPU 2 consequently stalls. 

Often, the data item will become available in a small number of clock cycles. When this 
happens, the valid signal is switched high and the data is sent to the CPU 2. The 
execution unit 27 of the CPU 2 can then carry on running the program. Since the 
execution unit 27 of the CPU 2 stalls when data is not available and does not take 
different data from the FIFO 16, no data can be iost from the system 1, and no 
complicated software is required because everything is controlled by the stream register 
block 5. Furthermore, since the CPU 2 connects directly to the FIFO 16, data can be input 
to the CPU 2 without the need for slow accesses across a memory bus. This method 
automatically deals with flow control. 
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Reference is now made to figure 2a. To improve .he flow of data through the system the 
stream register unit 5 can atso contain a small FIFO 17. The taKen signal is set to * 
high across the communication channel 8 while the stream register FIFO 17 ,s not full. 
This allows data to be taken at a higher rate from the stream register unit 5. Furthenmore 
data may be taken from the stream register uni. 5 for use by the execution unit 27 at a 
slightly different rate from that at which it is being transmitted to the VDC 
combination of the VDC FIFO 16, communications channel 8 and stream reg,ster FIFO 17 
allowing these communications to flow smoothly. 

The stream register unit 5 in this embodiment has a 32 bit wide port, bu, it could have a 
different data width should that be appropriate. However, the data passmg through the 
channel 8 does no. need to have mis same date width. For example. In 
the VDC 4 peripheral stores 16 bit data items, therefore data is sent to the CPU 2 as 
single 16 bl. items. A pair of 16 bi. values could be written as a packed word, which would 
be twice as fast. 

Whilst the system of figures 1, 2 and 2a is advantageous in lhat it is simple, avoids the 
need for complicated software and provides an improvement in the speed of returmng 
date ,o the CPU 2 over some known systems, ft can softer from a couple o 
disadvantages. Firstly. If a data item requested by the execution on,. 27 of ft* > CPU2 
does no. become available during a large number of clock cycles, say about 10 000. there 
is a risk that such an excessive delay will eventually result in irrecoverable staling of the 
execution unit of the CPU 2. due to there being no control mechanism to deal with th,s 
situation This would result in the user of the system needing to switch me CPU 2 off and 
on again to allow the CPU 2 to reset. Another consideration is that there is no mechanism 
for signalling an interrupt to the execution unit 27 of the CPU 2 while ft is stalled. If an 
interrupt were able .o be signalled by a timeout mechanism, ft would allow me execufion 
unit 27 of the CPU 2 to perform other tasks while i. wafted for .he missing data item, whrch 
would increase overall efficiency of .he system 1. To deal with these issues, a second 
embodiment, is shown in figure 3. 



7 



Figure 3 shows the equivalent part of the system 1 as is shown in figure 2, but including 
additionally a timeout generator 6 within the stream register unit 5. The timeout generator 
6 detects the situation when the execution unit 27 of the CPU 2 is stalled awaiting a data 
item to be received from the communication channel 8 and the stream register FIFO 17 is 
empty for a programmable prolonged period. This raises a TIMEOUT signal which causes 
the execution unit 27 to stop attempting to receive data from the stream register unit 5 and 
to start execution of an exception handler dedicated to the stream register unit 5 timeout 
process. This exception handler means that the application of a system reset can be 
avoided since any possible deadlock associated with the interface is detected by the 
exception handler. 

Furthermore, since the control of communications with the VDC 4 is entirely contained 
within the stream register unit 5, it can be arranged that a stall associated with an access 
to the stream register unit 5 by the execution unit 27 can be interrupted by another 
requestor other than the stream register unit 5 without altering the stream register state 
between the execution unit 27 and the stream register unit 5. Therefore another requestor 
can raise a timeout signal, allowing the execution unit 27 to perform other tasks during a 
stall caused by lack of data availability in the stream register unit 5. 

The signalling involved in execution of the second embodiment will now be described with 
reference to figure 4. 

Figure 4 shows some components of the system 1 used in the second embodiment. The 
channel 8 is shown connected to the VDC 4 and to the CPU 2 as before. This channel 8 
is used to receive and process the data, valid and taken signals described above. The 
CPU 2 contains the stream register unit 5, the register file 26, the load/store unit 28 and 
the execution unit 27. The CPU 2 Is additionally shown to be connected to the main 
memory 1 1 via the memory bus 3. The execution unit 27 makes requests for data via the 
load/store unit 28 from either the main memory 1 1 via the memory bus 3, peripherals such 
as first and second peripherals 14, 15 via the memory bus 3 (see figure 1), or peripherals 
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such as the VDC 4 connected to the stream register unit 5. These requests are placed in 
the register file 26 for use by the execution unit 27. 

,„ operation, as before, the execuflon unit 27 makes a reguest for n items of data, via the 
,„ad,s.ore unit 28 (anov, 37) to the receiver 29 of the stream register unit 5 (anew 3* 
Such a reguest can be initiated, for example, from C program code as an access tea 
volatile variable. Dafa Is taken from fhe next locaaon In tee stream register ^ .7 rem 
^ch date should be taken, the FIFO 17 having received dafa from ft. VDC 4 as 
described previously. As explained above, data is forwarded automatically from the VDC 
FlFO 16 to the stream register FIFO 17 using .he data, valid, taken signals, as descnbed 
previously. In .his case .he stream register block 5 has received me date witeou. delay. 
L VDC 4 having se. high the date and valid signals (and sen. the date item) to .he 
stream register FIFO 17 . Thus the load/store unit 28 of .he CPU 2 receives me data ,.em 
from me stream register FIFO 17 (arrow 40) and stores if temporarily in the register file 26 
(arrow 42), from where the execution unit 27 can retrieve it (arrow 44). 

This process is repeated for each of the n Items of date, untess and until an item of date is 
no. ready to be taken from the stream register FIFO 17. This eventuality is descnbed ,n 
the following paragraph. 

„ a date item requested by the execution unit 27 of tee CPU 2 is no. in the nex, location in 
,he stream register FIFO 17. the stream register 5 sends a stel, signal to tee execute* , u 
27 (arrow 46), thus causing the execution unit 27 to stall running of .he program. If the 
oafa item remains unavaiiable and tee execution unit 27 stalled for a P"-*™- ^ 
,he timeout generator 6 of tee stream register unit 5 returns a TIMEOUT Signal te tee 
execution uni, 27 (arrow 48). This signal falls fhe execution uni. 27 tea. I. can ,n errup, tee 
program and perform Cher tasks. Therefore even though fhe program is steHad, tee 
execution unit 27 of tee CPU 2 Is still capable of servicing interrupts and hence 
pTlnce o, tee system 1 is maintained. I. w,„ be understood teat tee fimeou 
generator 6 of the stream register block 5 operates in such a way .hat it has ma e tee 
execute*, unit 27 stel, before if becomes aware tea. if is no. going to rece,ve tee data that 
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it has requested. This is because the TIMEOUT signal 48 is sent in response to the 
execution unit 27 attempting to access the data from the register file 26. The danger of an 
irrecoverable stall is thus avoided. 

The predetermined time after which the TIMEOUT signal is sent from the timeout 
generator 6 to the execution unit 27 is a programmable value, termed the "timeout 
constant". 

When the data that the execution unit 27 has requested becomes available, the valid and 
data signals from the VDC 4 are switched high. This is in turn communicated across the 
communications channel 8 and data is transferred from the VDC FIFO 16 to the stream 
register FIFO 17. The stall signal from the stream register 5 to the execution unit 27 
(arrow 46) is taken low to un-stall the execution unit 27. 

The timeout is cleared explicitly by the interrupt program writing an instruction to the 
stream register unit 5 to clear the timeout signal. The interrupt program will write this 
instruction at a convenient point in the task being performed by the execution unit 27 
during stalling of the program. Once the timeout signal has been taken low, running of the 
program continues. 

If during this time the execution unit 27 has been interrupted to perform another task, any 
data is buffered in the stream register FIFO 17 ready for retrieval by the execution unit 27 
at a later time. This continues until the stream register FIFO 17 becomes full, in which 
case it puts the taken signal to the communication channel 8 to logic low. This will prevent 
more data being sent from the VDC FIFO 16 to the stream register FIFO 17. 

It will be understood by those skilled in the art that the above-described system is more 
efficient than some known systems for another reason, namely that it allows the 
availability of each data item to be tested individually. When a CPU is attempting to 
access peripheral data over a bus, it is usual to test blocks of data, which is more time 
consuming than the testing of a single data item as in the present system. The system 
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also reduces bus traffic. It also avoids the need to write polling loops into the program, 
thus avoiding complexities in coding. 

The above-described system works equally well with a number of FIFOs linked together in 
either or both of the VDC 4 or the stream register unit 5. This might be useful, for 
example if the channel 8 needs to store some of the data which it is transferring. The 
above-described signals would simply be routed through the chain of FIFOs and the 
signals between the stream register block 5 and the execution unit 27 would remain the 
same The data, valid and taken signals are augmented appropriately by each FIFO in the 
chain guaranteeing data ordering and integrity. It will be understood that the first 
embodiment described with respect to figure 1 would also function with a number of l.nked 
FIFOs. 

The handshake signalling system described using FIFO data, valid and taken signals 
using logic level high for an active signal and logic level low for an inactive signal ,s not a 
requirement for operation of the invention. The invention would work equally well with 
other handshake mechanisms. 

It will also be appreciated that a similar system could be used for the reverse situation, 
namely the CPU 2 writing data to the VDC 4. This would allow delays in writing of data to 
the VDC 4 caused by lack of memory space to be buffered from the CPU 2. In addition it 
would allow the CPU 2 to produce data at a different rate from that at which it is consumed 
by the VDC 4, without the need for complex circuitry. 
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CLAIMS 

1. A processing system for accessing first and second data types, the first data type being 
data supplied from a peripheral and the second data type being randomly accessible data 
held in a data memory, the processing system comprising : 

a processor for executing instructions; 

a stream register unit connected to supply data from the peripheral to the 
processor; 

a FIFO connected to receive data from the peripheral and connected to the stream 
register unit by a communication path, along which the said data can be supplied from the 
FIFO to the stream register unit; and 

a memory bus connected between the data memory and the processor, across 
which the processor can access the randomly accessible data. 

2. A processing system according to claim 1, wherein the stream register unit forms part 
of the processor. 

3. A processing system according to claim 1 or claim 2, wherein data is supplied from the 
FIFO to the stream register unit in accordance with requests for data made by the 
processor to the stream register unit and forwarded to the FIFO. 

4. A processing system according to claim 3, wherein the said requests are made as 
accesses to volatile variables. 

5. A processing system according to claim 3 or claim 4, wherein, the FIFO is arranged to, 
upon receiving a request for data from the stream register unit, send a signal to the stream 
register unit indicating availability of the requested data. 

6. A processing system according to claim 5, wherein if the FIFO contains the requested 
data, the said signal to the stream register unit indicates that the data is available, and the 
FIFO is further arranged to send a signal to the stream register unit comprising the data. 
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7 A processing system according ,o Calm 6. where.-, the stream register unit is ^arranged 

8 a processing system according to Cairn 6 or Cairn 7. wherein the stream 
Inge^, Jewing receipt of the signs, compdsing ft. data, send a s,gna, fo the FIFO 
indicating that it has taken the data. 

g a processing system seconding to Cairn 8. wherein the said signal to the FIFO further 
indicates the next location in the FIFO from which data is recurred. 

register unit indicating that the data is not avaiiable. 
executing instructions. 

« — — -rrsrc.'tr— 

following sending of the satd ^ r eam register unit indicating that 

13 . A processing system according. « -^^^1^^ 
generator, arranged for common, *n-* the p = . ^ ^ 

and arranged «o. . the srgna en, flme ^ . ameout slgnal t0 
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14. A processing system according to claim 13, wherein if following sending of the 
timeout signal to the processor the data subsequently becomes available, the timeout 
generator is arranged to receive a signal instructing it to cease sending the timeout signal, 
and to, upon receipt of the said instruction, cease sending the timeout signal. 

15. A processing system according to claim 13 or claim 14, wherein the stream register 
unit is arranged to, if following sending of the timeout signal to the processor the data 
subsequently becomes available, send the data to the processor. 

16. A processing system according to any preceding claim, wherein the stream register 
unit is associated with a register file containing a plurality of registers and a load/store unit 
arranged to receive data from the stream register unit and temporarily store the data in the 
register file. 

17. A processing system according to claim 16, wherein the processor is arranged to 
retrieve data from the register file. 

18. A processing system according to claim 16 or claim 17, as dependent on claim 3, 
wherein the processor is further arranged to make requests for data to the stream register 
unit via the load/store unit. 

19. A processing unit according to any preceding claim, wherein the stream register unit 
comprises one or more FIFOs connected to receive data from the FIFO connected to the 
stream register and supply the data to the processor. 

20. A processing system according to any of claims 3 to 19, wherein the request for data- 
is a request for a single data item. 

21. A processing system according to any preceding claim, further comprising one or 
more additional FIFOs linked together between the said FIFO and the communication 
channel. 



14 



2 , A processing system according to any preceding claim, — n *. data from *. 
peripheral is video data. 

23 . A processing system according to Cairn 23. wherein .he peripherai - • *- 
processing system. 

24 . A processing sysfem subtly as herein .escribe. wffh reference to an. as 
shown in the accompanying drawings. 

25. A streaming data handling system, comprising : 

a processor; 

a stream register associated with the processor; 

t ^ tho nmcessor via the stream register, 
a F1FO memory connected to the data handling proloco , 

^i"^— = 

supply them to the processor in the received order. 

rH-.no to claim 25 further comprising the timeout generator as required 

c,rrr a r it- —» - - *— - - — 

register. 

2 , A stream raster conned between a processor an* a peripherai. fne stream 

register comprising : ^ ^ ^ processot; and 

a receiver arranged to recede a ^ ^ ^ ^ or 

:r,r:." p sand , he «. « . - - . - - - 

l8 no. available, send a timeout signal to the processor. 
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29. A stream register according to claim 28, wherein the stream engine is arranged to the 
interrupt signal to the processor after a predetermined period of time. 

30. A stream register according to claim 28 or claim 29, wherein the stream engine is 
further arranged to, if the data is available, temporarily store the data in a register file for 
access by the processor. 

31. A stream register according to any of claims 28 to 30, wherein the stream engine is 
further arranged to, following sending of the timeout signal to the processor, if the data 
item subsequently becomes available, receive a signal instructing it to cease sending the 
timeout signal, and to, upon receipt of the said instruction, cease sending the timeout 
signal to the processor and temporarily store the data in a register file for access by the 
processor. 

32. A stream register connectable between a processor and a memory, the stream 
register comprising : 

a receiver arranged to receive a request for a data item from the processor; and 
a stream engine, arranged to send the request to the memory and receive one or 
more signals back from the memory indicating availability of the requested data item, and, 
if the data item is available, send the data item to the processor and if the data item is not 
available, send a stall signal to the processor. 
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ABSTRACT 



A processing system for accessing first and second data types. The first data type is data 
supplied from a peripheral and the second data type is randomly accessible data held in a 
data memory. The processing system comprises a processor for executing instructions; a 
stream register unit connected to supply data from the peripheral to the processor; a FIFO 
connected to receive data from the peripheral and connected to the stream register unit by 
a communication path, along which the said data can be supplied from the FIFO to the 
stream register unit; and a memory bus connected between the data memory and the 
processor, across which the processor can access the randomly accessible data. 



Figure 2 
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